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Abstract 

In quantitative finance, we often fit a parametric semimartingale 
model to asset prices. To ensure our model is correct, we must then 
perform goodness-of-fit tests. In this paper, we give a new goodness-of- 
fit test for volatility-like processes, which is easily applied to a variety 
of semimartingale models. In each case, we reduce the problem to the 
detection of a semimartingale observed under noise. In this setting, we 
then describe a wavelet-thresholding test, which obtains adaptive and 
near-optimal detection rates. 


1 Introduction 

In quantitative finance, we often model asset prices as semimartingales; in 
other words, we assume prices are given by a sum of drift, diffusion and jump 
processes. As these models can be difficult to fit to data, we often restrict 
our attention to a parametric class, of which many have been suggested by 
practitioners. To verify our choice of parametric class, we must then perform 
goodness-of-fit tests. 

As semimartingale models can be quite complex, there are many po¬ 
tential tests to perform. In the following, we will be interested in testing 
whether models accurately describe processes such as the volatility, covolatil¬ 
ity, vol-of-vol or leverage. We will further be looking for tests which can be 
shown to obtain good rates of detection against a variety of alternatives. 

While many goodness-of-fit tests exist in the literature, fewer have been 
shown to obtain good detection rates. Those tests which do achieve good 
rates are generally designed for one type of semimartingale model, and one 
way of measuring performance. 

In the following, we will therefore describe a new goodness-of-fit test for 
volatility-like processes in semimartingales. Our test can easily be applied 
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to a wide range of models, including stochastic volatility, jumps and mi¬ 
crostructure noise, and obtains good detection rates against both local and 
nonparametric alternatives. 

Our method involves reducing any goodness-of-fit test to one of semi¬ 
martingale detection: given a series of observations, is the series white noise, 
or does it contain a hidden semimartingale? We will show how this prob¬ 
lem can be solved efficiently, obtaining adaptive and near-optimal detection 
rates. 

We now describe in more detail the problems we consider, as well as 
relevant previous work. Our goal will be to test the goodness-of-fit of a 
parametric semimartingale model. Many such models have been described, 
including simple models such as Black-Scholes or Cox-Ingersoll-Ross; Levy 
models such as the generalised hyperbolic or CGMY processes; and stochas¬ 
tic volatility models such as the Heston or Bates models. (For definitions, 
see Cont and Tankov, 2004; Papapantoleon, 2008.) 

In the simplest case, where our observations are known to come from a 
stationary or ergodic diffusion process, a great many authors have described 
goodness-of-fit tests. We brieffy mention some initial work (Ait-Sahalia, 
1996; Corradi and White, 1999; Kleinow, 2002) as well as more recent discus¬ 
sion (Gonzalez-Manteiga and Grujeiras, 2013; Papanicolaou and Giesecke, 
2014; Ghen et ah, 2015). 

In a financial setting, however, even if our model is stationary, we may 
need to test it against non-stationary alternatives. When observations can 
come from a non-stationary diffusion, goodness-of-fit tests have been de¬ 
scribed using the integrated volatility (Gorradi and White, 1999), estimated 
residuals (Lee, 2006; Lee and Wee, 2008; Nguyen, 2010) and marginal density 
(Ait-Sahalia and Park, 2012). Goodness-of-fit tests also exist for regressions 
between diffusions (Mykland and Zhang, 2006). 

In the following, we will be interested in goodness-of-fit tests which 
not only detect non-stationary alternatives, but also achieve good detec¬ 
tion rates. In this setting, Dette and von Lieres und Wilkau (2003) propose 
a test which can detect misspecification of the volatility at a rate in 

norm (see also Dette et ah, 2006; Podolskij and Ziggel, 2008; Papanicolaou 
and Giesecke, 2014). 

A similar test proposed by Dette and Podolskij (2008) detects alterna¬ 
tives in a fixed direction at the faster rate although the authors do 

not give rates in L^. This test can also be applied to more complex mod¬ 
els, including stochastic volatility (Vetter, 2012) and microstructure noise 
(Vetter and Dette, 2012). 

In some volatility testing problems, previous work has described tests 
which achieve optimal detection rates against nonparametric alternatives 
(ReiB et ah, 2014; Bibinger et ah, 2015). However, these tests are specific 
to the problems considered, and do not assess the goodness-of-fit of general 
models. 


2 


In the following, we will therefore describe a new method of goodness-of- 
fit testing for volatility-like processes. We will show how our approach ap¬ 
plies to a wide variety of semimartingale models, including those with jumps, 
stochastic volatility and microstructure noise. In each case, we will obtain 
adaptive detection rates, with near-optimal behaviour not only against al¬ 
ternatives in a fixed direction, but also against nonparametric alternatives. 

To construct our tests, we will reduce each goodness-of-fit problem to 
one of semimartingale detection: we will construct a series of observations 
Zj, which under the null hypothesis are approximately white noise, and then 
test whether the Zj contain a hidden semimartingale St- 
For example, suppose we have a semimartingale 

dXt = btdt + dBt, 

where Bt is a Brownian motion, bt and fit are predictable processes, and 
we make observations W-, f = 0,... ,n, where the times ti := i/n. Further 
suppose we have a model fj,{t,Xt) for the volatility, and wish to test the 
hypotheses 


Hq : fit = fi{t, Xt) vs. Hi : fit unrestricted. 

To estimate fit, we define the realised volatility estimates 

y* := n(Wi+i - f = 0, ...,n-l. 

Since the scaled increments ^/n{Xt^^-^ — Xt^) are approximately N{0,fit^), 
the observations Yt have approximate mean flt^ and variance . Under 
Hq, we thus have that the normalised observations 

Zi := (Vi - ^(U, W,))/cr(U,XtJ, cr^ ;= 2fi‘^, 

are approximately white noise. 

Under Hi, we instead obtain 


Zi — Sti -\- Si 


( 1 ) 


where the semimartingale 

St ■= (fit- fi{t,Xt))/a{t,Xt), 

and the approximately-centred noises 

Si := {Yi- fitO/a{ti,XtO. 

To test our hypotheses, we must therefore test whether the the series Zj is 
approximately white noise, or contains a hidden semimartingale St- 

If the noises Si were independent standard Gaussian, independent of 
St, we could consider this a standard detection problem in nonparametric 
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regression. Conditioning on St, we could take the semimartingale as fixed, 
and then apply the methods of Ingster and Suslina (2003), for example. 

Under suitable assumptions on the process St, its sample paths would 
be almost ^-smooth, and we would thus be able to detect a signal St at rate 
77 ,-i/ 4 supremum norm, up to log terms. Alternatively, if we wished to 
detect signals St oc et, for a fixed direction et, we could do so at a rate 

In general, however, the signal St may depend on past values of the 
noises Si, and vice versa. We will thus not be able to appeal directly to 
results in nonparametric regression, and will instead need to use arguments 
developed specifically for the semimartingale setting. 

In the following, we will show that testing problems like (1) can be 
solved with detection rates similar to those of nonparametric regression. 
We will further show that many semimartingale goodness-of-fit tests can 
be described in a form like (1), including models with stochastic volatility, 
jumps or microstructure noise. 

Our approach will be similar to wavelet thresholding (Donoho et ah, 
1995; Hoffmann et ah, 2012); essentially, we will reject the null whenever a 
suitable wavelet-thresholding estimate of St is non-zero. While this method 
is known to work well in the standard nonparametric setting, we will need 
to prove new results to apply it to settings like (1). 

Our proofs will use a Gaussian coupling derived from Skorokhod embed¬ 
dings. We note that as our results must apply in a general semimartingale 
setting, we will not be able to use faster-converging couplings, such as the 
KMT approximation. We will show, however, that under reasonable mo¬ 
ment bounds, a Skorokhod embedding will suffice to achieve the desired 
detection rates. 

Indeed, with this construction we will show our tests detect semimartin¬ 
gales St at a rate in supremum norm, up to log terms, even when St 

contains finite-variation jumps. Furthermore, our tests will simultaneously 
detect simpler signals at faster rates; for example, we will be able to de¬ 
tect signals St in a fixed direction et at a rate up to logs, without 

knowledge of the direction et- 

We will finally show that in each case, the rates obtained are near- 
optimal. Applying our tests to problems like (1), we will thus be able to 
construct goodness-of-fit tests for a wide variety of semimartingale models, 
obtaining adaptive and near-optimal detection rates. 

The paper will be organised as follows. In Section 2, we give a rigorous 
description of the problems we consider, and discuss examples. In Section 3, 
we then construct our tests, and state our theoretical results. In Section 4, 
we then give empirical results, and in Section 5, proofs. 
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2 Semimartingale detection problems 


We now describe our concept of a semimartingale detection problem. Our 
setting will include volatility goodness-of-fit problems like (1), as well as 
many other semimartingale goodness-of-fit tests. 

We begin with some examples of the problems we will consider. In each 
case, we will describe a semimartingale model with a volatility-like process 
Hf We will wish to test the null hypothesis that is given by some known 
function Xt), for an unknown paramter 6o G 0, and an estimable 

covariate process Xt G M*?; our alternative hypothesis will be that fit is not 
given by /r. 

To test our hypothesis, we will construct -measurable observations 
Yi, and a variance function Under the null, and conditional on T*., 
the observations Yi will have approximate mean and variance fj,{6o,ti, Xt^) 
and XtJ. To estimate these means and variances, we will further 

construct estimates 6 and Xi of the parameters 9q and covariates W; • 

We will then be able to estimate the difference between the observations 
Yi and their means fi, scaled according to their variances cr^; we will reject 
the null hypothesis when the size of these scaled differences are large. In 
Section 3, we describe in detail how we perform such tests, as well as giving 
theoretical results on their performance. 

For now, we proceed with some examples of semimartingale goodness-of- 
ht problems in this form. Let Bt and B'^ be independent Brownian motions, 
X{dx, dt) be an independent Poisson random measure with intensity dx dt, bt 
and b[ be predictable locally-bounded processes, and ft{x) be a predictable 
function with 1A \ft{x)\^ dx locally bounded, for some j3 G [0,1). Further 
define times t' := i/n^. 

We then have the following examples. 

Local volatility We wish to test a model for fit in the process 

dXt = btdt + y/JIl dBt , (2) 

making observations Xt^, i = 0,...,n. We set Xi := Xt^, and estimate 
|JLt^ by the realised volatility (Andersen et ah, 2001; Barndorff-Nielsen 
and Shephard, 2002), 


Yr.= n{Xt^^,-Xuf. 

We then define the variance function := 2/i^. 

Jumps We wish to test a model /i for ^it in the process 

dXt = btdt + y/JTt dBt + ft{x) X{dx, dt), 
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making observations Xt^, i = 0,...,n. We set Xi := Xt^, and estimate 
Ha by the truncated realised volatility (Mancini, 2009; Jacod and Reifi, 
2014), 

^ dni^/^iXt^^l Xi^)')^ Qn{x) X 
for any sequence an > 0 satisfying 

log(n) = o(an), On = o{n'^) for all k > 0. (3) 

We then dehne the variance function := 2fj?. 

Microstructure noise We wish to test a model [i for fit in the process 

dXi^t = btdt + ^/JTt dBt- 

We make observations 

Xi i '■= Xi t', + Si, i = 0,..., n^, 

where the noises et are measurable in the filtrations ■= ns>t' 
and satisfy 


IE[ei I -Ft'] = 0, 

E[4 I J-,,] = 

E[|er I -^t'] < 

for an Itb semimartingale X 2 ^t with locally-bounded characteristics, 
and constants k > 8, C > 0. We estimate Xf. and fitj by their pre¬ 
averaged counterparts (Jacod et ah, 2009; Reifi, 2011), 

Xij := n Yli =0 

X2,j := (2n)-^ E"=oiXl,nj+^+l - ^l,ni+^)^ 

Yj := vr^(2n“^(X]r=o^ cos(7r(i -h l)/n)Xi^nj+i)‘^ - ^2,i)- 

We then define the variance function := 2{fi + 7r^Jh2,t)^. 

Stochastic volatility We wish to test a model fi for fit in the processes 

dXi^t = btdt + \fX^t dBt, 
dX2,t = b'tdt + y/JTt dB't, 

making observations Xi t '., i = 0,..., n^. We define volatility estimates 
X2,i ■■= n^{Xi^tr^^ - Xi^trf, i = r? - I, 
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which we use to estimate Xt- and fit- (Barndorff-Nielsen and Veraart, 
2009; Vetter, 2012), 



We then dehne the variance function := 2(/r + 27r^V|^)^. 

Others Many other models, for example including covolatility or leverage, 
or combining any of the above features, can be described similarly. 
For simplicity, we assume in the following that the times ti are deter¬ 
ministic and uniform; however, models with uneven or random times 
that are suitably dense and predictable can be addressed in a similar 
fashion. 

To concisely describe these examples, and others, we will state a set of 
assumptions on the observations Yi, mean and variance functions /r and ci^, 
parameters 9, covariates Xt, and estimates Xi. It will be possible to show 
that the above models all he within our assumptions, and we may thus work 
within these assumptions with some generality. 

To begin, we define some notation. Let || • || denote any finite-dimensional 
vector norm; write a = 0{b) if ||a|| < CH^H, for some universal constant C; 
and write a = Op(b) if for each e > 0, the random variables a and b satisfy 
P(||a|| > Ce||6||) < e, for universal constants Cg. 

We stress here that the implied constants C and are universal; in 
statements such as o = 0(1), we require the supremum sup ||a|| over all such 
a to be bounded. Given a function / : V —>■ M, we also define the supremum 
norm ||/||^ := sup,,gjY|/(x)|. 

Our assumptions are then as follows. 

Assumption 1. Let (Q,T', (Ti)ig[o,i])IP) be a filtered probability spaee, with 
adapted unobserved mean, variance and eovariate processes pLt E K, cr| > 0, 
and Xt E respectively. For 0<t<t + h<l, letting Wt denote either 
of the proeesses pit or Xt, we have 


Wt = 0(1), 
E[Wt+h - Wt I Ft] = 0(h), 
n\\Wt+h - I Ft] = 0(h). 


( 4 ) 


For i = 0, ...,n — 1, we have Fti_^_i-measurable estimates Xt of Xt^, 


satisfying 


E[\\X,-Xt^f\Fu] = 0(n-^), 
E[\\X,-Xt^f\Ft,] = 0(n-^). 


( 5 ) 
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We also have J^n+i-iTT-Go^surable observations Yi, satisfying 


E[y, I J-iJ = Mi,+0(n-V2), 

Var[Fi| = +0(n-i/4), (6) 

I J^u] = 0(1), 


for a constant e > 0. 

Under the null hypothesis Hq, we suppose our observations Yi are de¬ 
scribed by a parametric model, 

Ht = ii{eo,t,Xt), af = a‘^{eo,t,Xt), 

for known functions fi, : Q x [0,1] x —>■ M, and an unknown param¬ 

eter 00 £ Q. We suppose that Q C MP is closed, and is positive. We 
also suppose the functions p. and cr^ are locally Lipschitz in 6, continuously 
differentiable in t, and twice continuously differentiable in X. Finally, we 
suppose we have a good estimate 9 of Oq, satisfying 

0-0o = Op(n-O2). 

Under the alternative hypothesis Hi, we instead allow pt, o't unrestricted, 
and require only that 6 = Op(l). 

To ensure the examples given above lie within Assumption 1, we must 
require that the parameter space 0 C be closed, and the model function 
p be locally Lipschitz in 9, continuously differentiable in t, and twice contin¬ 
uously differentiable in Xt. These conditions should all be satisfied for most 
common models. 

We must further require the semimartingales Xt to be bounded, and 
have bounded characteristics. In general, this assumption may not hold 
directly; however, we can assume it without loss of generality using standard 
localisation arguments. 

In Section 5.3, we then check that the above examples satisfy our con¬ 
ditions on the processes pt, at, and Xt] estimates Xp and observations Tj. 
Most of these conditions follow from standard results on stochastic processes; 
where necessary, higher-moment bounds can be proved using our Lemma 1 
below. 

To satisfy Assumption 1, it remains to choose an estimate 9 of 9 q, having 
error under Ho, and being Op(l) under Hi. While our results are 

agnostic as to the choice of 0, a simple choice is given by the least-squares 
estimate 

9 := argmin^ee Ynloi^i “ Xi))"^, (7) 

which can be found by numerical optimisation. Under standard regularity 
assumptions for nonlinear regression, this estimate 9 can be shown to satisfy 


our conditions, arguing for example as in Section 5 of Vetter and Dette 

( 2012 ). 

Finally, we note that in the microstructure noise and stochastic volatility 
models, we need to make -n? + 1 observations of the underlying process Xt to 
construct the n estimates Yi. We may thus expect to achieve the square-root 
of any convergence rates given below; such behaviour, however, is common 
to all approaches to these problems in the literature. 

We have thus shown that many different semimartingale goodness-of-fit 
problems can be described by our Assumption 1. Next, we will describe our 
solutions to these problems. 

3 Wavelet detection tests 

To state our tests for the problems given by Assumption 1, we first consider 
the signal function 


St{e) := {^it- ^^{e,t,Xt))/a{e,t,Xt). 

This function measures the distance of the model mean fi from the true 
mean weighted by the model variance cr^. Under Hq, we have 

sm ~ SM = 0 , 

while under Hi, we can in general expect |S't(0)| to be large. We may thus 
reject Hq whenever an estimate of St{0) is significantly different from zero. 

To estimate the signal St{0), we will use wavelet methods. Let (/? and V’ 
be the Haar scaling function and wavelet, 

V’:= 1 [0,1/2) - 1 [1/2,1), 

and for j = 0,1,..., k = 0,... ,2^ — 1, define the Haar basis functions 

(Pj,k{t) := 2^^‘^(p{2H - k), ■■= 2^l'^'t\){2H - k). 

We can then describe St{0) in terms of its scaling and wavelet coefficients 

<^3,k{0) '■= fo ^j,k{t)St{9) dt, Pj,k{0) ■= fo '^j,k(i)St(9) dt. 

To estimate these coefficients, we first pick a resolution level J G No, so 
that 2*^ is of order We then estimate the scaling coefficients C(j^k{9) by 

aj,k{0) ■= ^J,k{ti)Zi{6), 

where the normalised observations 


Z,{9) := {Yi-fi{9,ti,X,))/a{9,ti,Xi). 
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We note that for fixed 9, these estimates can be computed in linear time, as 
each observation Yi contributes to only one coefficient 3j^fc(0). 

To estimate the coefficients ao,o(^) and /3j^k{9), 0 < j < J, we then 
perform a fast wavelet transform, obtaining estimates 

«o,o(6') := E/ fd '■= fd 

We note that efficient implementations of this transformation, running in 
linear time, are widely available. 

To test our hypotheses, we will take the maximum size of these estimated 
coefficients, producing test statistics 

f{e) := maxo<j<j,A: 130,0(0)1, 

We will show that under Ho, T{6) is asymptotically Gumbel distributed, 
while under Hi, T{9) will tend to be greater. 

Theorem 1. Let Assumption 1 hold. 

(i) Under Hq, 

a-}{n^/^f(9) -b2j)AG 
uniformly, where the constants 

am ■= (21og(m))"^/^, 

bm ■■= - ^amlog(7rlog(m)), 

and G denotes the standard Gumbel distribution. 

(a) Under Hi, 

f(e)-T{e) = Oj,{n-^/Hog{nf/^) 

uniformly, where 

T{9) ;= maxo<j<j,fc |ao,o(0)|, I3j,fc(0)l- 

We thus obtain that under Hq, T{9) concentrates around zero at a rate 
log(n)^/^. Under Hi, it concentrates at the same rate around the quan¬ 
tity T{6), which measures the size of the signal St{6). We can use this result 
to construct tests of our hypotheses, and prove bounds on their performance; 
we first note that for some of our bounds, we will require the following as¬ 
sumption. 

Assumption 2. The processes pLt and Xt are ltd semimartingales, 

Tt = Sdibs ds + {cffY' dBs + /r fs{x) \{dx, ds)), 

= Jo + /r A(dx, d.s)). 
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for a Brownian motion Bg € independent Poisson random measure 

X{dx,ds) having eompensator dxds, predietable processes bg, bf^, Cg, = 
0(1), and predictable funetions fg{x), f^s{x) satisfying 1 A \ fs{x)\dx = 
0 ( 1 ). 

Under Assumption 2, we thus have that and Xt are Ito semimartin¬ 
gales, with bounded characteristics and finite-variation jumps. This assump¬ 
tion holds for many common financial models, if necessary after a suitable 
localisation step. Using this condition, we are now ready to describe our 
tests, and bound their performance. 

Theorem 2. Let Assumption 1 hold, and for a E (0,1), define the Gumbel 
quantile 

qn,a ■= -a 2 J log(-log(l - «)) 62^5 

and eritieal region 

Cn,a ■= ( 0 ) > qn,a}- 

(i) Under Hq, we have IP[0„_q,] —>■ a uniformly. 

(a) Under Hi, let > 0 be a fixed sequence with Mn —oo. If is one 
of the events: 

(a) {||5'(0)||^ > M„n"^/^log(n)^/ 2 }^ given also Assumption 2; or 

(b) {maxo<j<j,fc 2^/‘^\J^_.l:^^^^St(0)dt\ > log(n)^/ 2 }. 

we have ^{En \ Cn,o\ —^ 0 uniformly. 

We thus obtain that the test which rejects Hq on the event Cn,a is of 
asymptotic size a, and under Assumption 2 , can detect signals St{9) at the 
rate log(n)^/^ in supremum norm. We further have that, even without 

Assumption 2, our test can detect a signal whenever the size of its mean over 
a dyadic interval is large. 

In particular, if St{6) oc et for some non-zero deterministic process e*, 
then et must have non-zero integral over some dyadic interval 2~^[k, A: -|- 1 ). 
We deduce that our test can detect signals in the fixed direction et at the 
rate log(n)^/^, without prior knowledge of et. 

We can further show that these detection rates are near-optimal. 

Theorem 3. Let Assumption 1 hold, and 6n > 0 be a fixed sequenee with 
5n —^ 0 . If En is one of the events: 

(i) {||S(e)IL given also Assumption 2; or 

(a) {maxfc 2 '?-/ 2 | St{6) dt\ > for some = 0 ,..., J; 
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then no sequence of critical regions Cn can satisfy 


limsup^ P[C'„] < 1 


uniformly over Hq, and 


uniformly over Hi. 


FlEn \ ^ 0 


We thus conclude that our goodness-of-fit tests achieve the near-optimal 
detection rate of log(n)^/^ against general nonparametric alternatives, 
in a wide variety of semimartingale models. This result is already a signifi¬ 
cant improvement over previous work; we note that similar methods do not 
establish near-optimality for the procedures of Dette and von Lieres und 
Wilkau (2003), for example, where the corresponding lower bound would be 

Furthermore, we have shown that our method simultaneously provides 
near-optimal detection rates against alternatives which are easier to detect, 
including the case where the signal St{0) lies in a fixed direction et. We may 
thus achieve good detection rates in a fully nonparametric setting, without 
sacrificing performance against fixed alternatives. 


4 Finite-sample tests 

We next consider the empirical performance of our tests. As convergence 
to the Gumbel distribution can be quite slow, in the following, we will 
consider a bootstrap version of our tests, which will be more accurate in 
finite samples. 

The general procedure is as follows. First, we estimate the parameters 
9 from the data, using some estimate 6. Next, we simulate many sets of 
observations from the null hypothesis, with parameters chosen by 9. 
Any components of the null hypothesis not described by 9, such as drift or 
jump processes, are set to zero. 

For each set of simulated observations Yf f we then compute a parameter 
estimate 9^^\ and statistic Finally, we reject the null hypothesis if 

the original statistic T{9) is larger than the (1 — aj-quantile of the simulated 
statistics 

We now perform some simple Monte Carlo experiments on these tests. 
We will compare our tests to those of Dette and von Lieres und Wilkau 
(2003), Dette et al. (2006) and Dette and Podolskij (2008), using the same 
methodology as Dette and Podolskij. As in that paper, we will generate 
Monte Carlo observations in the local volatility setting (2). We will then 
use our tests to evaluate the goodness-of-fit of various parametric models 
for the volatility. 
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In each case, we consider receiving n = 100, 200 or 500 observations, 
and constructing confidence tests at the a = 5% or 10% level. We then 
generate 1,000 realisations of simulated data, compare our statistic against 
1,000 bootstrap samples in each realisation, and report the proportion of 
runs in which the null hypothesis is rejected. 

In our tests, we set the resolution level J := [log2(n)/2j, and use the 
least-squares parameter estimates Q given by (7). As the models we consider 
will be linear in the parameters 0, we will be able to compute these estimates 
in closed form, as linear regressions. 

Table 1 then gives the observed rejection probabilities of our tests in two 
models: a constant volatility model, where fi{x, t, 6) = 9; and a proportional 
volatility model, where 9) = 9x^. In each case, we give results for our 

tests under a variety of null and alternative hypotheses. 

We note the hypotheses tested are the same as in Tables 1-4 of Dette 
and Podolskij (2008), as well as Table 3 of Dette and von Lieres und Wilkau 
(2003), and Tables 3.1 and 3.4 of Dette et al. (2006). We may thus directly 
compare the performance of our tests to those given in previous work. 

We find that in both models, our tests have good coverage under the null 
hypothesis, and reliably reject under the alternative hypothesis. The power 
of our tests is competitive with previous work under the constant volatility 
model, and generally improves upon previous work under the proportional 
volatility model. 

We conclude that our tests not only achieve good theoretical detection 
rates, but also provide strong finite-sample performance. They may thus 
be recommended for many different goodness-of-fit problems, whether pre¬ 
viously discussed in the literature, or newly described by our more general 
assumptions. 

5 Proofs 

We now give proofs of our results. In Section 5.1 we will state some technical 
results, in Section 5.2 give our main proofs, and in Section 5.3 prove our 
technical results. 

5.1 Technical results 

We first state the technical results we will require. Our main technical result 
will be a central limit theorem for martingale difference sequences, bounding 
the exponential moments of the distance from Gaussian. 

Lemma 1. Let (D,%', (%'j)”^g,P) he a filtered probability space, and let Xi, 
z = 0,... , n — 1, be X-i+i-measurable real random variables. Suppose that for 
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n 

a 

100 

200 

500 

5% 

10 % 

5% 

10 % 

5% 

10 % 

Constant volatility, null, = 1 





bt = 0 

0.048 

0.105 

0.056 

0.101 

0.035 

0.089 

bt = 2 

0.055 

0.114 

0.057 

0.103 

0.044 

0.084 

bt = Xt 

0.056 

0.101 

0.041 

0.093 

0.037 

0.092 

bt = 2-Xt 

0.048 

0.095 

0.052 

0.105 

0.051 

0.100 

bt = tXt 

0.038 

0.094 

0.060 

0.101 

0.063 

0.111 

Constant volatility, alternative. 

bt = X 

t 




y/Jh — ^ + Xt 

0.777 

0.840 

0.898 

0.932 

0.976 

0.985 

= 1 + sin 5Xt 

0.964 

0.977 

0.997 

0.999 

1.000 

1.000 

y/Jh = 1 + Xt exp t 

0.954 

0.975 

0.987 

0.994 

0.999 

0.999 

y/Jh = 1 + Xt sin bt 

0.851 

0.908 

0.970 

0.982 

0.994 

0.995 

y/Jh = 1 + tXt 

0.742 

0.796 

0.883 

0.914 

0.951 

0.972 

Proportional volatility. 

null, pt 

= xi 





II 

o 

0.062 

0.119 

0.044 

0.090 

0.043 

0.087 

bt = 2 

0.073 

0.120 

0.056 

0.106 

0.043 

0.081 

II 

0.070 

0.115 

0.055 

0.100 

0.043 

0.098 

bt = 2-Xt 

0.053 

0.085 

0.055 

0.100 

0.034 

0.081 

II 

0.070 

0.106 

0.062 

0.123 

0.045 

0.106 

Proportional volatility. 

alternative, bt = 

= 2-Xt 




Pt = l + Xi 

0.602 

0.673 

0.700 

0.766 

0.844 

0.884 

Pt = ^ 

0.832 

0.871 

0.927 

0.951 

0.979 

0.991 

Pit = ^Xtf^ 

0.580 

0.669 

0.672 

0.760 

0.854 

0.902 

Pt = 5 Xt 

0.896 

0.932 

0.963 

0.974 

0.995 

0.998 

pt = {l + Xtf 

0.831 

0.878 

0.894 

0.929 

0.964 

0.979 


Table 1: Observed rejection probabilities for bootstrap test. 
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some K>1, 


I ^i] = 0, 

(i) If also 

E[IEr=o E[X2 I - IP" I -^o] = 0{n-), 

then on a suitably-extended probability space, we have real random vari¬ 
ables r] and M, independent of F given Fn, such that 

Er=o^* = e + r/; 

is standard Gaussian given we have 

I J-o] = 0(n-"); 

for ti G R, 

E[exp(u7? — ^u^M) I J'o] < 1; 

and M > 0 satisfies 


E[m 2^ I Jb] = 0{n-'^). (8) 

(a) For random variables Ci = 0(1), let Vc ■= Y17=o Then on a 

suitably-extended probability space, we have a constant A = 0(1) and 
real random variable M, independent of F given Fn, such that 

suPcE[|uc|‘^'" I -^o] = 0(1); 


for u GR, 

supc E[exp(ttT;c — ^u‘^{A + M)) | J^)] < 1; 
and M >0 satisfies (8). 

We will also need the following result on combining exponential moment 
bounds. 

Lemma 2. Let {Il,F,R) be a probability space, with real random variables 
{Xi)'fFQ and M. Suppose that for u G R, 

E[exp(uW - iu^M)] = 0(1), 

and M = Op{rn) for some rate > 0. Then 

maxj \Xi\ = Op(ry^log(n)^/2). 
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Our next technical result will bound the moments of our observations Yi, 
and their normalisations Zi{9). The result will be stated using the Holder 
spaces C®, defined as follows. Given a function / : X —)• M, for suitable 
X C we define the 1-Holder norm 


ll/llci •= ll/lloo V sup^^y^x\f{x) - f{y)\/\\x - y||, 
and the 2-Hdlder norm 

^ V max^^^||(V/)i||^i, / is differentiable, 


C2 — \ 

I oo, otherwise. 

We also say / is C® if ||/||(^s < oo. 

Lemma 3. Under Hq or Hi, suppose the Xi = 0(1), and 0 is bounded. 

(i) For fixed i and Yi, the variables ZfiO) are functions of 9 and Xt, 
with Holder norm 0(1 -|- |li|). 

(ii) The variables St{9) are funetions of 9, t, fit and Xt, and for fixed 
9 and t, also functions of fit and Xt, both with Holder norm 0(1). 


(Hi) For 9 G Q, we have 


nZfi9)\Ft,] = Stfi9) + Oin-^/‘^), 


\Zi 


i+e 


= 0 ( 1 ), 


and under Hq, also 

E[Z,(0o)"| J)J = l + 0(n-i/^). 


(iv) Define times 


Sk ■= \n2 '^k'\/n. A: = 0,..., 2'^. (9) 

Then 

maxfc n-^/2 Y2=ns\~^ = Op(l). 

Finally, we will need a result controlling the behaviour of the processes 
St{9) under Assumption 2. 


Lemma 4. Under Hi, suppose 0 is bounded, let 0„ C Q be a sequence 
of finite sets, of size 0(n'^) for some k > 0, and let 5n = 0(n“^/^). Given 
Assumption 2, we have 


St{9) = St{9) + St{9), 


where the proeesses St{9) and St{9) are as follows. 
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(i) We have 

s^V9&e^,\s-t\<sJSs{0) - St{B)\ = Op(n-i/'^log(n)i/2^. 

(ii) In -L^([0,1]), let Pjf denote the orthogonal projeetion of f onto the 
subspace spanned by the sealing funetions and define the remain¬ 
der Rjf ■.= f - Pjf. Then 

sup0e0j|i2j5(6»)|U = Op(n"i/Mog(n)^/2)_ 

(Hi) We have a random variable G N, and random times 0 = tq < • • • < 
tn = 1, such that the proeesses St{0), 6 G 0„, are constant on intervals 
[n,n+i), [tat-i,ttv], and 

P[minj(rj+i - n) < 6n] ^ 0. 


5.2 Main proofs 

We may now proceed with our main proofs. We first prove Theorem 1, 
beginning with a lemma controlling the variance of our estimated scaling 
coefficients aj^k{B)- 

Lemma 5. For k = 0,... ,2'^ — 1, 0 G 0, define scaling-coefficient variance 
terms 

ajMO) ■■= n-^ Er=o TJ,k{ti)iZiiO) - E[^i(0) | 

(i) Under Hq, suppose the Xi = 0(1). Then on a suitably-extended prob¬ 
ability space, we have a filtration {Qk)k=o, o.nd Qk+i-measurable real 
random variables f,k, hk, Mk, such that 

= fk + Vk] 

the variables f,k o,re standard Gaussian given Qk] 

E[exp(n% - \u^Mk) \ Qk] < 1; 
and the variables Mk > 0 satisfy 

I Gk] = 0(n-(^/2+£/8))_ 

(ii) Under Hi, suppose 0 is bounded, and the Xi = Xt^. We then have 
constants Ak = 0(1), and on a suitably-extended probability spaee, a 
filtration {Gk)k=Q CLnd real random variables Mk, such that 

snpQ^QW\eyiY>{un^/‘^aj^k{0) - \u^{Ak + Mk)) \ Gk] < 1; 

the variables Mk > 0 satisfy (10); and the aj^kfi) and Mk are Gk-\-i- 
measurable. 
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Proof. We first prove part (i), and argue by induction on k. Let Go = Tq, 
and suppose that for i = 0,..., A: — 1 we have constructed, on an extended 
probability space, cr-algebras Gi+i, and random variables f,i,rji, Mi satisfying 
our conditions. We suppose also that Gk has been chosen to be independent 
of F given where the times Sk are given by (9); we note this condition 
is trivially satisfied for Go- 
We can then write 


= E“sr' c.. 

where the m := n(sfc+i — Sk) summands 

0 := n-^/H^/\Zi{eo) - nZi{0o) \ 

To compute the moments of the Q, we may apply Lemma 3(iii), noting that 
since we are only interested in 0 = 0O) we may assume 0 is bounded. We 
thus have 

lE[Ci I J^ti,Gk] = 0, 

Er=n:r' i ^u^<3k] = i+ 

using also that the Q are independent of Gk given Ft^. 

We may therefore apply Lemma l(i) to the variables n^^‘^aj^k{(^o)- On a 
further-extended probability space, we obtain random variables ^ki Vk: ^k 
satisfying the conditions of part (i), independent of F given Gk and 
Defining Gk+i to be the u-algebra generated by Gk^ -^sfc+u % and 
we deduce that Gk+i satisfies the conditions of our inductive hypothesis. By 
induction, we conclude that part (i) of our result holds. 

To prove part (ii), we argue similarly, noting that the random variables 

where the JD...-measurable summands 
*'1+1 

Ci:= n-^lH^I\Y,-W.\Yi\Fu]), 
and the -measurable coefficients 


Ci{9) := l/a{e,ti,Xtf). 


As the function a is continuous and positive, and 6 and Xt are bounded, we 
have the variables Ci{9) = 0(1). We may thus apply Lemma l(ii), producing 
random variables A^, satisfying the conditions of part (ii). The result 
then follows as before. □ 
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We now prove a lemma bounding the variance of our estimated scaling 
and wavelet coefficients ao,o(^)) Pj,k{G)- 

Lemma 6. Suppose the Xi = 0(1), and for j = 0,..., J—1, k = 0,... ,2^ — 1 
and 6 G &, define the wavelet-coefficient variance terms 

hk{0) := n-i Er=o - nz^{e) I Xtfi). 

Similarly define scaling-coefficient variance terms 5o,o(^) using (/?o,o- 

(i) Under Hq, suppose 9 — 9 q = 0(n“^0). Then on a suitably-extended 
probability space, we have real random variables f,j^k,fij,k,Vj,k such that 

= ^-1,0 + + 'y-1,0; 

^ ^ fii,kifi') ij,k S' Pj^k “1“ ^j,ki 

the fj^k o,re independent standard Gaussian; and for some e' > 0, 
maxj^k\Vj,k\ = Op{n~^'), max^^fc 2^/'^\vj^k\ = Op(l). 

(a) Under Hi, suppose 0 is bounded. Then 

suPj-fc,(?ge|5o,o(6')|, \fij,k{9)\ = Op{n-^/‘^log{ny/‘^). 

Proof. We will consider the wavelet-coefficient variance terms fij^k{9)] we 
note we may include scaling-coefficient variance terms 5o,o(^) similarly. To 
prove part (i), we then apply Lemma 5(i). We obtain a filtration Qi, and 
variables Mi, and pi as in the statement of the lemma. Since 

Pj,k{^) = 'Eibj,k,iaj,i{9), 

where the coefficients 

hj,k,i •= /o 4’j,kTJ,l, 

we have 

^ ^ l^j,k(,9) — f,j,k T 9j,k T '^j,ki 

for terms 

f,j,k ■ ^^lhj,k,lf,h 9j,k ■ ^^lhj,k,l'nii 

and ^ ^ 

Vj,k ■= n^^‘^{/3j,k{0) - I3j,k{0o)). 

We first describe the terms fj^k- Since the are jointly centred Gaussian, 
so are the f,j^k- Furthermore, we have 

= Y,i bj,k,ibj'^k',i 

= foCEi hj,k,iTj,i){Y.v hj',k',vTj,v) 

~ fo '^j,k'fij',k' 

~ ^{j,k)={j',k’)- ( 11 ) 
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We deduce that the are independent standard Gaussian. 
We next bound the rjj^k. Setting 


M := max; Mi , 


we have that 

IE[M2+^/2] < = C>(n-=/8), 

so M = Op{n~^') for some e' > 0. Using (11), we also have 

E[exp{urjj^k - < £[[]; exp(u6j-^ 

< 1 . 


The desired result follows by applying Lemma 2. 

Finally, we control the Vj^k- Since we are only interested in 0 = we 
may assume 0 is bounded. For 9,9' E 0, |0 — 9'\ = 0(n“^/^), we then have 

SUPj,^(0) - fc(0')l 

= maxj,fcO(n“^/22-?'/2) I]r=o^lV’i,fc(L)|(1 + |Ti|), 


using Lemma 3(i), 

= 0(n-'''^)(l + maxtn-‘''2X)Sir'|ril) 

= 0(n-‘/")(l + (maxj 


by Cauchy-Schwarz, 


= Op(n-i/2), (12) 

using Lemma 3(iv). We deduce that 

supj-fc2^/2|£;^.^| ^ Op(l). 

To prove part (ii), we first claim we may assume the W = Xt^. To prove 
the claim, we define terms 


Z'(0) := (Yi-^i{9,ti,Xt,))/ai9,ti,Xt,), 

and 

\ x^]). 

We then have 


supi,fc,eeel4fc(0) 

= 0(n“^)maXj-fcX;”r(||V^j-fc(L)|(l + |Ti|)||W - X*. 
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using Lemma 3(i) 


= 0(n ' E?.o‘ 

by Cauchy-Schwarz, 

= Op(n-V2)(i + max, n-V2 y^2)i/2^ 

since EErJo 11^* " ] = 0(1), 

= Op(n"^/2), 
using Lemma 3(iv). 

We may thus assume the W = Xt^, and so apply Lemma 5(ii). On an 
extended probability space, we obtain a filtration Qi, constants Ai = 0(1), 
and variables Mi as in the statement of the lemma. Setting 


M := max;(^; + Mi), 
we obtain that M = Op(l), and 

sup5)geE[exp(Mn^/^,0j,fc(6') - ^u^M)] < 1, 
arguing as in part (i). Letting denote a n“^/^-net for 0 C M^, of size 

0(nP/2), 

we thus have 

T^axj^k,e&eJPj,k{0)\ = 0p(n"02 iog(^p/2)i/2^ 

= 0p(n"^/2 log(n)02)^ 

using Lemma 2. 

Next, for any 0 G 0, we have a point 9 G 0„ with 9 — 9 = 0(n“^/^). 
Using (12), we deduce that 

supj,fc,e6el^i,fc(6') - Pj,ki9)\ = Op{n-^/^). 

We conclude that 

supj,fc,0eel^j,fc(6')| = Op(n-02 iog(^)i/2)_ □ 

Next, we prove a lemma bounding the bias of our estimated scaling and 
wavelet coefficients ao,o(^), I3j,k{9)- 

Lemma 7. Suppose the Xi = 0(1), and for j = 0,..., J—1, /c = 0,..., 2'^ —1 
and 0 G 0, define the wavelet-coefficient bias terms 

■■= n-^ Er=o V',,A:(U)E[Z,(0) I J-iJ - 4,(0), 

Similarly define scaling-coefficient bias terms aofi{9) using ipofl- 
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(i) Under Ho, suppose 9 — Oq = 0{n Then 

maXj-fc|ao,o(^)|, = Op(n"^/^). 

(ii) Under Hi, suppose 0 is bounded. Then 

supj,fc,0e0|ao,o(6')|, \^j,k{Q)\ = Op{n-^l‘^). 

Proof. We will bound the wavelet-coefficient bias terms /3j^j^{9); we note we 
may include the scaling-coefficient bias terms ao,o(^) similarly. For t G [0,1], 
define t := [nt\/n, and set 

■■=fof’i,k(t)(St(9)-Si(9))dt. 

In each part (i) and (ii), we will show that ^,(0) is close to /3 (0), which 

is small. 

We note that in either part we may assume 0 is bounded, since in 
part (i), we are only interested m. 9 = 9o, 9. We then have 

^ rl-^Tri=o\'^3,k{ti)mz^{e) I :Fu]-Su{e)\ 

+ /oVj.fcW - i’j,ki't)\\St{9)\ dt 

= (13) 

using Lemmas 3(ii) and (iii). It thus remains to bound the fi {9). 

To prove part (i), we note that 

where the JF.,,-measurable summands 
*'1+1 

ObT := - f^j,k{t){St{9o) - SuiOo)) dt. 

Using Lemma 3(ii) and Taylor’s theorem, we also have that 

st{9o) - Sum = mt - m+djm - wj 

T P \\Xt — XuW'^ + n ^), 

for bounded T).-measurable random variables Ci G M, dj G 
We deduce that 

E[Ci,i,A: I m = 

and similarly 

Var[0,,,fc I m < E[C2^. , I m = 0{nm^). 
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Furthermore, for fixed j and k, we have that all but 0{n2 of the Ci,j,k are 
almost-surely zero. We thus have 

= 0(n-2). 

We deduce that 

E[maxj-fc^^.^(0o)^] < Ej,fe fc(^o)^] 

<0{n-^)E,2^ 

= 0(n-3/2^, 

so maxwfcl/3., (0o)| = We also have 

using Lemma 3(ii). We conclude that 

+ maxj,fc 2^/‘^\p.^^{9o) - §_j ,.(0)\ 

+ maxj-fc 2^/‘^\p.^^{9) - Pj^kW] 

= Op(n"^/2), 

using (13). 

To prove part (ii), using Lemma 3(ii), we have 

St{9) - Sti9) = 0{\^it - /Til + llXi - WII + n-i). 

We deduce that 

SUPj,fc,0G0l-5j-fc('^)l 

= 0{l) sup j^k Jo\'iPj,k{t)\i\^it- fJ-t\ + \\Xt- Xt\\ +n-^)dt 

= 0(l)(sup^-fc/(J 'iplk{t) 

^ iloilk't- k'tf + \\Xt- Xt\\‘^ + n~^) dt)^/‘^, 
by Cauchy-Schwarz, 

= Op(n“^/2), 

since V’|fc(^) dt = I, and 

HloilfJ-t- + \\Xt- XtW'^ + n“2) dt] = 0{n~^). 

Using (13), we conclude that 

supj,fc,0g0l^i,fc(6')| =Opin-^/^). □ 
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We can now prove our limit theorem for the statistic T{9). 

Proof of Theorem 1. We first note that our estimated scaling and wavelet 
coefficients are equivalently given by 

aofl{9) = rT^ J2i=o ‘Po,o(t)^iW, Pj,kiO) = YhZq 'ifj,k{'t)Zi{9). 
We may thus make the variance-bias decomposition 

«o,o(0) — «o,o(0) = ao,o(^) + ao,o(^)) 

Pj,k{G) — fij,k{0) = Pj,k{G) + 

where the terms 00 , 0 , ^ 0 , 0 , f^j,k and are defined by Lemmas 6 and 7. We 
will proceed to bound the distribution of T{9) using these lemmas. 

We begin by showing we may assume the estimated covariates W = 0(1). 
We note that 

E[maxi||Wf] < E[supjWf] + - Xt,f] = 0(1), 

so maxj||W|| = Op(l). For a constant R > 0, define the variables 

^ _f Xi, II < 

1 Xt^ , otherwise. 

Then as i? —)• 00 , the probability that the Xi and Xi agree tends to one, 
uniformly in n. It thus suffices to prove our results replacing the Xi with 
the Xi] equivalently, we may assume the Xi = 0(1). 

We now prove part (i). Since 9 — 9q = Op(re“^/^), we may similarly 
assume 9 — 9q = 0(n“^/^). Let J 2 = and write 

f{9) = max(r(0),r(0)), 


where the terms 


T{9) := maxo<j<j 2 ,fc |So,o(6')|, \Pj,k{0)\, 
f{9) := maxj2<j<j,fc |^j'fc(6')|. 


Under Hq, using Lemmas 6(i) and 7(i), we can then write 
v}I‘^T(9) = maxo<j<j2,fc |Cj,fc| + Op(l), 

n^/^T(0) = maxj2<j<j^fc |fj-fc| -h Op{n-^'), 

for some e' > 0, and independent standard Gaussians 
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By standard Gumbel limits, we also have 


(maxo<j<j2,fc \Cj,k\ - ^ 2 -^ 2 ) G, 

a~}{maxj^<j<j^k \^j,k\ - h-r) G] 

we note that in the second limit, we may use the constants 02 J and b 2 j, 
rather than a 2 j_ 2 J 2 and 62 J- 2-^2 > as the difference is negligible. We deduce 
that 

P[f (0) = f{e)] 1, 

and so 

- b2j) = ( 0 ) - 63 ^) + Op(l) 

= a~}{ma^j^<j<j^k\Cj,k\ - ^2-^) + Op(l) 

4- G. 

Next, we prove part (ii). As before, since 9 = Op(l), we may assume 
9 = 0(1), and hence that 0 is bounded. Using Lemmas 6 (ii) and 7(ii), we 
then have 


f(9) - T{9) = 0(1) maxo<j<j,fc|So,o(^) - ao,o(^)l, |4i,fc(^) - I3j,k{^)\ 

= Op(n"^/^log(n)^/^). 

Finally, we note that the rates of convergence proved depend only upon 
the bounds assumed on the inputs. They therefore hold uniformly over 
models satisfying our assumptions. □ 

Next, we will prove our results on test coverage and detection rates. 

Proof of Theorem 2. We hrst note that part (i) is immediate from Theo¬ 
rem l(i). To prove part (ii), we consider separately the cases (a) and (b). 
In each case, we will prove that with probability tending to one, the event 
En implies 

T{9) > M'^n-G‘^\og{nfG^ 

for a fixed sequence —)• 00 . The result will then follow from Theorem l(ii). 

In case (a), we note that arguing as in Theorem 1, we may assume 0 
is bounded. Let 0„ be an n“^/^-net for 0, of size 0{'nP/^), and 9_ be an 
element of 0 „ satisfying 

d-9 = 0{n-^/^). 


Using Lemma 3(ii), we have 

St{9) = St{l) + 0{n-^/^), 
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so on Em 

l|5(0)IL > ll-5(0)lloo - > M„n-V4 iog(n)V2/2, 

for large n. We may thus assume further that 9 G 0„. 

We then apply Lemma 4, obtaining processes St{6), St{9), and times r,. 
On the event En, for some point u G [0,1], we have 

|*S'„(0)| > M„n“^/'^log(n)^/^. 

We thus have u G [r*, Tj+i) for some i < A^ — 1, or u G [ri, Ti+i] for i = A^ — 1. 
From Lemma 4(iii), with probability tending to one we also have 

Ti+l -Ti> 2^“'^, 

and so there exists a point u G [r, + 2““^, Tj+i — 2“*^], \u — v\ < 2“'^. 

We deduce that with probability tending to one, 

|aO,o(^*)| + Z]/=0 

= |ao,o(0)<Fo,o(?^)| +12o<j<J,k\P3,k0)'4’j,k{v)\ 

writing the projection Pj in terms of the wavelet functions 

> 15 ^ 0)1 - 15 .( 0 ) - Sum - \RjSum 

> 15^0)1 -15,(0) - Sum - \RjSum, 

since St{9) is constant within a distance 2“'^ of u, 

> M„n“^/'^log(n)^'^^/2, 
using Lemmas 4(i) and (ii). We deduce that 

T{e) > max(|Q!o,o(^)|, |/3 j,2-j[2j,;j(^)I : J = 0,..., J - 1), 

> 2-l-'+3V2(|a„„(«)| + («)|) 

> log(n)^/^, 

for a sequence —>■ oo. 

In case (b), on the event E^, we likewise have 

I“0,0(^) I + 2-5^^l/3j,2-J[2i-J"fc„J (^)l 

> iPps.-^ukM 

= »'\fE:t*''s,id)dt\ 

> Mn2PPn~^P log(n)^/^, 

for some jn = 0,... , J and kn- The result then follows as in part (i). □ 


26 


Finally, we can prove our lower bound on detection rates. 

Proof of Theorem 3. In each case (i) and (ii), we will reduce the statement 
to a known testing inequality. We will consider the model 

Yi := - B^) + e,, 

where Bt is an adapted Brownian motion, the independent -measurable 
variables e* are standard Gaussian given 7"*., r E [0,1] is to be defined, and 
in case (i) we set jn ■= J- It can be checked that this model satisfies our 
assumptions. 

Under Hq, we set r := I, so we have mean and variance functions 

^ := 0, CT^ := 1. 

Under Hi, we instead set r := tm, where m := [n(l — . We then have 

St = - Br), 

so in case (i), 

Similarly, in case (ii), 

F[En] > Stdt\ > 5nn-B^] ^ 1. 

It remains to show that no sequence of critical regions Cn can satisfy 
limsup^< 1 under Hq, and P[C,i] —)• 1 under Hi. We note that un¬ 
der Hq, we have Y ~ N{0,I), while under Hi, Y ~ N{0,I -|- 6nY), for a 
covariance matrix 

^k,l = 0 V /\l — m)!inf. 

As S is non-negative definite, and has Probenius norm 0(1), the result 
follows from Lemma 2.1 of Munk and Schmidt-Hieber (2010). □ 

5.3 Technical proofs 

We now give proofs of our technical results, beginning with a demonstration 
that our examples satisfy our assumptions. 

Lemma 8. The examples in Section 2 satisfy the conditions of Assump¬ 
tion 1. 

Proof. As in Section 2, we may assume our conditions on p., a and 6 are 
satisfied. It thus remains to establish the conditions (4)-(6) for each of the 
examples. 
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Local volatility By standard localisation arguments, we may assume con¬ 
dition (4), as well as that bt, = 0(1). Condition (5) is trivial. To 
establish condition (6), we make the decomposition 

hi = {Zi^i + Z2^i + , 

where 



We then have that, for p > 0, 

Zi^i I 

E[\Z2,if \ = 0{n-P/^), 

using Burkholder-Davis-Gundy, and 

\Z3,i\ = 0(n-V2). 

The desired bounds follow using Cauchy-Schwarz. 

Jumps By localisation, we may again assume (4), as well as the bounds 
bt, /u 1 A \ft{x)\^ dx = 0(1). Condition (5) is again trivial. For 
(6), we write 

= 9n{Zl^i + ^2,i + -^3,1 + ^4,i + Z^^i), 

where 


Zi,i 

^=v^i:r 

y/9ti dBt, 

Z2,i 


ht ft{x) X{dx,dt) 

Z3,i 

^=v^i:r 

Jac ft{x) X{dx,dt) 

ZA,i 

^=v^i:r 

{y/JZt ~ y/Thi) dBt, 

Z5,i 

^=v^i:r 

bt dt. 


and 

At := {x E M : |/t(x)| < 
for some sufficiently small h' > 0. 

We then have that, for p = 2,4,16, and some e' > 0, 

^i,i I ~ iV(0,ptJ, 
E[|Z2,/ I WtJ = 0(n-V2-'), 
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by repeated application of Burkholder-Davis-Gundy, 


I J^ti] = 0, 

by Ito’s Lemma, 

nz^A^ I ^u] = o{n-p/A. 

by Burkholder-Davis-Gundy, and 

\z^A = o{n-^'A- 

Letting Zi := Zi^i -|- Z 2 ^i -|- .^ 4 ,i -|- Z^^i, and ¥( := Z^, we deduce that 

E[y/| 

nyl?\^u\ = <yl+^Jil+o{n-^/A. 

E[|y/|® I j-ij = 0(1). 

It thus remains to control the difference between Yi and ¥(. 

We first note that for any p >2, small enough q > 1, and as in (3), 

E[\Z,\nz2^^A = Oin-^A + 0(1)E[|Zi,|^’1^2>„J 
= Oin-^A + Oimzf > aA^A 

using Holder’s inequality, 

= Oin-^A, 

using a Gaussian tail bound, Markov’s inequality, and that an grows 
super-logarithmically. We also have 

P[Z3,i A 0 I J^u] = 0(n-V2-5'), 

using our assumptions on ft{x). As an grows sub-polynomially, the 
required bounds follow also for Y). 

Microstructure noise As before, by localisation we may assume condition 

(4) , as well as the boundedness of characteristic processes. To show 

(5) , we first note that 

Xi,j-Xr,t,=Wij + W2,j, 

where 

Wij ■= n X]i=0 ^i,nj+ij 

tUj := n-' EfcKWji,,, - Xu,)- 
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For p = 2,4, we then have 

n\Wij\nTt^] = Oin-P/^), 
using Lemma l(ii), and 

n\W2,j\^ I -Ft,] = 0(n-P/2), 

using Burkholder-Davis-Gundy. We deduce that 

E[\Xij - \ Tt,] = 0{n-P/^). 

By a similar method, decomposing X 2 J into a sum of nuisance terms 
and martingale difference sequences, the same bound holds also for 

To show (6), we write 

Yj = 2{Zij + Z 2 J + — Tr‘^X2j, 

using integration by parts, where 

Zij := vFt sin(7rn(t - tj))y/jMdBt 

+ Z]r=o^ cos(7r(t + ^)/n)enj+i, 

Z 2 J ■= \/n sin(7rn(t — tj))bt dt, 

Z3,j ■= vr(n"^/2 cos(7r(i + 

— cos(7rn(t — tj))Xi^t dt). 

We then have 

E[2ZIj I Tt,] = lit, + 7^^X2,t, + 0{n-^/^), 
by direct computation, 

E[4ZIj I Ft.] = + 7r2X2,t,)" + 

using Lemma l(i), 

E[|Zi,,-n-FiJ = 0(1), 
using Lemma l(ii), 

\Z2,j\ = 0{n-^/^), 

and for p > 0, 

E[\Z3jf\Ft,] = 0in-P), 
using Lemma l(ii). The desired results follow. 
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Stochastic volatility Using integration by parts, we can make a decom¬ 
position 

Yj = 2{Zij + Z 2 J + Z^j -|- Z4j)‘^ — 27r^X|j, 

where 

Zij := y/n sin(7rn(t — tj))y/JitdB[ 

+ Ya=q cos(7r(i + \)/n)x 

{X2,nj+i — ^[X2,nj+i \ 

^2j ■= \Ai sin(7rn(t — tj))b[ dt 

Z3,j ■■= 7 r(n -^/2 cos(7r(i + \)ln)X 2 ^t'^.^. 

— cos(7rn(t — tj))X2^t dt), 

Z 4 J := 7rn“^/^ I2?=o^ cos(7r(i + ^}/n)x 

(E[X 2 ,nj+i I - X 2 ^t'^^^J. 

The desired results then follow similarly to the microstructure noise 
example. □ 

Next, we give some standard exponential moment bounds on stochastic 
integrals. 

Lemma 9. Let (f2,T', (Ti)jg[o,cxD])IF’) be a filtered probability space, with 
adapted Brownian motion Bt G M, and Poisson random measure X{dx, dt) 
having compensator dx dt. 

(i) Let ct be a predictable process, and define 

pl-.= !lclds. 

pto < 00 almost surely, then for u G M, the stochastic integral 

Wt := JqCs dBs 

satisfies 

E[exp(MlUoo - ^u^pIo) I -^ 0 ] < 1- 

(a) In the setting of part (i), if < -R almost surely, we further have 
E[exp(usupj>o|VU| — u^R) \ To] = 0(1)- 

(Hi) Let t > 0, and /s(x) G M 6e a predictable function, with |/s(a:)| < 1, 

/o /rI/*(^)I dxds = 0{R), 
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for some R G (0,1). Then for u < — log(i?), the stochastic integral 
■= lo {Kdx,ds) - dxds) 

satisfies 

E[exp(usup^g[o,t]|H/'s|) I J'o] = 0(1)- 

Proof. We begin with part (i). By localisation and martingale convergence, 
we have 

Wt Woo. 

The Doleans-Dade exponential 

8 {uW)t ■■= exp(rtlW - pi) 

is a non-negative local martingale, so a supermartingale. Hence by Fatou’s 
lemma, 

E[T(uW)oo I .To] < liminfi^ooE[£:(nW)t | To] < E[T(uW)o | To] = 1. 

To show part (ii), we have 

E[T(ttW)^ I To] < exp(tt^T)E[T(2uW)oo | To] < exp(n^ii), 

so £{uW)t is a true martingale. We deduce that 

E[exp(nsup^>o|lW| —vfR) \ To] 

< exp(-iri^T)E[sup 4 >oT(nW)t -t-supi>oT(-ttW)t | To] 

= 0(l)exp(-in2T)E[T(nW)L + T(-«W)L | 

using Doob’s martingale inequality, 

= 0 ( 1 ). 

We now prove part (iii), noting we may assume u > 0. Defining the 
variation martingale 

•= /o/ rI/s(^)I {Kdx,ds) - dxds), 

we then have 

sup^e[o.i]|l^^l < Mt + 0{R), 

so it suffices to bound Mt- Let Mf denote the continuous part of Mt, and 
AMt its jump at time t. Then for v > 0, the Doleans-Dade exponential 

£{vM)t ■■= exp(uMf) ns<t(l + vAMs) 

is a non-negative local martingale, so a supermartingale. 
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Furthermore, since uAMg G [0, — log(i?)], we have 
exp(uAMs) < 1 + cuAMs, 

where the constant c := (1 — R~^)/ log{R). We deduce that 

E[exp(MMt) I J^o] = 0(l)E[exp((c — l)uMf + uMt) \ J^o] 

= 0(l)E[exp(cuMf + u Y.s<t ^Ms) \ To] 

= 0{l)E[£{cuM)t I J-q] 

= 0{l)£{cuM)o 

= 0 ( 1 ). □ 

We may now prove our central limit theorem for martingale differences. 
Our argument uses a Skorokhod embedding, as in Mykland (1995) or Obloj 
(2004), for example. 

Proof of Lemma 1 . We begin with a Skorokhod embedding, allowing us to 
consider the variables Xi as stopped Brownian motions on an extended 
probability space. Our argument proceeds by induction on a variable k = 

0, ...,n. 

We claim that for i = 0,..., A: — 1, on an extended probability space, 
we can construct processes {Bi^t)te[o,oo)j which are Brownian motions given 
the cr-algebra Ri generated by R and Bq, ..., and are independent 

of R given Ri+i. We further claim we can construct variables r, G [0,oo), 
which are stopping times in the natural filtrations of the Bi^f, so that 
Xi = Bi^Ti- 

For k = 0, the claim is trivial; we will show that if the claim holds for 
k, it holds also for A; + 1. By Skorokhod embedding, on a further-extended 
probability space, we can construct a process B^^t which is a Brownian 
motion given R]^, and a variable R which is a stopping time in the natural 
filtration of Bk^t, such that the variable X^ ■= is distributed as X^ 

given Rk- 

Since the stopped process (.Bfc,tA?fc)te[o,oo) is continuous and eventually 
constant, the pair ((Bfc^j/\;^r^)jg[o,oo)) ^fc) takes values in a Polish space. We can 
thus define the regular conditional distribution Qk{x) of ((.Bfc^j/\;^^)^g[o,oo)) Pt) 
given Xk = x and R^- On a further-extended probability space, we can then 
generate a pair {{Bk,tATk)t&[o,co),Tk) with distribution Qk{Xk) given Rk and 
Xk, independent of R given Rk+i- 

We deduce that the triplet i{Bk^tATk)t€[o,oo)Xk, Xk) is distributed as the 
triplet i{Bk^tATk)te[ 0 ,oo)Xk, Xk) given Rk, and hence Bk^tAr^ and Tk satisfy 
the conditions of our claim. It remains to define Bk^t for t > Tk', we set 

Bk,t+Tt; ■ T-fc 4“ Bf^ ^, A ^ 0, 
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for an independent Brownian motion We then conclude that and 
Tfc satisfy the conditions of our claim; by induction, the claim thus holds for 
k = n. 

Next, we will show we can realise the sums ciXi as integrals against 

a common Brownian motion. Define a process 

Bt — !,£)•! 

where the variables 

r(j,t) := 0 V (t- Ei=oB)- 

We will show that Bt is a Brownian motion with respect to a suitable filtra¬ 
tion Gt, and that the sums CiXi can be written as stochastic integrals 

against Bt. 

For fixed j = 0,..., n — 1, the cr-algebras 

form a filtration in t > 0, and the variables T{j,t) are ^j^t-stopping times. 
For fixed t > 0, we can thus define the cr-algebras Gj,T{j,t)i which form a 
filtration in j = 0,..., n — 1, and the variables 

j{t) := max{j = 0,..., n - 1 : B < t}, 

which are '^(j^iystopping times. 

We can then define the cr-algebras 

Gt ■■= Gj{t),TUit),t), 

which form a filtration in t, and check that the process Bt is a ^^-Brownian 
motion. We conclude that given Bj-measurable variables ct, the sums 

= (14) 

where the ^^-predictable integrands 

fcit) ■■= E"=o Cjl(o,Tj]{Tij,t)). 

In part (i), we consider the case Q = 1, and obtain 
t:=oX, = b^, z.:=Er=oB. 

Defining the random variables 

^:=Bi, r]:=B^-Bi, 

we then have ^ ~ 1) given Fq. 
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Furthermore, using Burkholder-Davis-Gundy, we have 


E[|r?|^"| J-o]=0(1)E[|z.-1|2"| J-q], 

while using Lemma 9(i), 

E[exp(u7y — ^u^\v — 1|) | J^o] < 1- 

It thus remains to bound the distance of v from 1. 

For j = 0,..., n, define the -martingale 

VS- := - nn I H), 

and the total mean 

We then have 

|z/ - 1| < \Vn\ + \v - 11; 

we will show that both terms on the right-hand side are small. 
We first obtain 

n\Vn\^" I -^o] = o(iM(Er=oVi - E[Ti I I -^o], 

by Burkholder-Davis-Gundy, 

= 0(n"-') Er=o IE[|ri - E[r, | \ J-q], 

by Jensen’s inequality, 

= OK-')Er=oIE[|r.|^E-^o] 

= oK-')Er=oiE[iJf*rE-^o], 

by Burkholder-Davis-Gundy and Doob’s martingale inequality, 

= 0{n-Y- 

We also have that 

^=Er=oE[x2|-F.], 

by Ito’s isometry, 

as the Bj^t are independent of F given JS'+i- VVe deduce that 

E[|j/- 1|2"| Jo]=0(n-"), 


(15) 
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as required. 

In part (ii), we again apply Burkholder-Davis-Gundy and Lemma 9(i) 
to the sums (14). We claim that 

/o°° /c (*) dt<A + M, 

for terms A and M as in the statement of the Lemma, so 

SUPcIEilUcI'^'" I -^o] = 0(1), 

and 

sup^E[exp(ttUc — ^u^{A + M)) \ J^o] < 1- 

It thus remains to prove the claim. 

As before, we have 

H < |14| + |i^|, 

and 

= 0(n'-02-)(^--o'E[|W|"" I J-,])V2-, 
by Jensen’s inequality, 

= 0 ( 1 ). 

For any random variables Cj = 0(1), we deduce that 

/o°° fc{t) dt = J27=o cfn = 0{v) = 0(1 + |14|). 

The claim thus holds for terms A = 0(1), M = 0(|1^|), and we further 
have that M satisfies (8), using (15). □ 

We next prove our result on combining exponential moment bounds. 

Proof of Lemma 2. We first note that by rescaling the W, we may assume 
= 1. Then on an extended probability space, let ^ be standard Gaussian, 
independent of T”. For any i? > 0, we have 

E[exp{Xf /4R)1m<r] 

= E[E[exp((2J?)-V2x,e)lM<R I J^]] 

< E[exp(^V4)E[exp((2i?)-V2^Xi - l{2R)-^fM)lM<R \ C]] 

= 0(l)E[exp(eV4)] 

= 0 ( 1 ). 

We deduce that, for any R > 0, 

E[maxiexp(X2/4J?)lM<i?] < Y^iE[exp{Xf /4R)1m<r] = 0(n), 
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so 

maxi \Xi\lM<R = Op{R^/‘^ log(n)^/2^. 

Since M = Op(l), we conclude that max* |Xj| = Op (log(n)^/^). □ 

We continue with a proof of our moment bounds on the Yi and Zi{6). 

Proof of Lemma 3. To show part (i) , we note that the functions ja and 
are locally Lipschitz, cr^ is positive, and 9, L and Xi are bounded. We may 
thus restrict the functions /r and to a compact set, on which /i and cr^ 
are C^, and l/cr^ is bounded. We deduce that part (i) holds; by a similar 
argument, part (ii) holds also. 

To show part (in), we then have 

E[\Z,{9)f+^ I Xt,] = 0(1)E[1 + |y/+^ I Xt,] = 0(1), 

and 

Zi{e) = (Yi - fr{9,ti,XtJ)/a{9,ti,Xt,)+ji, 

for a term 

= o(i)(i + |y,i)||w-wj|. 

Using Cauchy-Schwarz, we obtain 

EN I J^u] = 0(n-'/2), 

E[7f I J^u] = 0{n-^/^). 

We conclude that 

E[zm\^u] = StM + nx\J^u], 

= StM + 0{n-^/% 

and under Hq, using Cauchy-Schwarz, also 

E[Z,i9of I Xu] = 1 + 0(l)(E[7f I Xu]^/^+E[^f \ X^]), 

= l + 0(n-^/^). 

To show part (iv), we define the random variables 

Rk ■■= m-^ EtuZ'm - I J^u]? - Var[y, | X,]), 

where m := n(sfc+i — Sk)- Rk is then an average of m terms of a martingale 
difference sequence, whose conditional variances are bounded. We deduce 
that 

E[Rl] = O(m-i) = 0(n-V2), 
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and so 


E[(maxfc n ^ y.2)2] = 0(i)E[l + maxfc Rl] 

= 0{1){1 + Ekml]) 
= 0 ( 1 ). 


The desired result follows. □ 

Finally, we prove our result on the behaviour of the processes St{0) under 
Assumption 2. 

Proof of Lemma 4- We begin by defining the processes St{9), St{0), and 
times Tj. We can split the process nt into parts 

Lt = L't + 

where fit is a process with jumps of size at most log(n)^/^, and /Ij is 

an orthogonal pure-jump process with jumps of size at least log(n)^/^. 

We can similarly define terms W, Xt- 

Let Ti < • • • < tm-i denote the times at which or Xt jump, and set 
To := 0, Tat := 1. We can then decompose the processes 

St{e) = St{e)+Mo), (le) 

where 

letting ASt{0) denote the jump in St{0) at time t, and St{9) is then defined 
by (16). 

To prove part (i), we first note that the model functions fi and cr^ are 
continuously differentiable in t, twice continuously differentiable in X, and 
is positive. By Itb’s lemma, we can thus write 

dSt{9) = bt{9) dt + ct{9)'^ dBt + /jj ft{x, 9) {X{dx, dt) - dx dt), 

for integrators Bt and X{dx, dt) given by Assumption 2, predictable processes 
bt{9),ct{9), and predictable functions ft{x,9f. Since 9, t, fit and Xt are 
bounded, we also have bt{9),ct{9) = 0(1), ft{x,9) = 0(n“^0 iog(n)^/2), 
lR\ft{x,9)\dx = 0{1). 

To bound the size of changes in St{9), we will consider the variables 
Mk{9) := supt^iJSt{9) - S2-Jk{9)\, 
where the intervals Ik '■= 2~'^[k, A: + 1]. We have 

Mk{9)<TZlMkM, 
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for terms 


MkM 


' i = 0, 

' ^^Vt&ik\!l-Jk^i,t{Q) dBi^t\, z = 1,..., g + 1, 

,supig 7 j/ 2 _j;i./R/i(x, 6 ') {X{dx,dt) -dxdt)\, i = q + 2. 


In each case z = 0,..., g + 2, we will bound the maximum 
Mi := maxfc,0e0^ Mk^i{9). 


From the definitions, we have 


Mq = 0(n-^/2). 


For z = 1,..., g + 1, we use Lemma 9(ii), obtaining that 
E[exp(un^/^Mfc,i( 6 <) — u^R)] = 0(1), 
for all u E M, and some fixed i? > 0. Using Lemma 2, we deduce that 

Mi = Op(n“^/^log(n)^/^). 

Finally, using Lemma 9(iii), for small enough e' > 0 we have 
E[exp(eV/^log(n)^/^Mfc^q+2(6'))] = 0(1). 

We deduce that 

E[exp(e'n^/‘^ log(n)^/2^g+2)] < Ylk,eee„ IE[exp(e'ni/^ log(n)^/2^fc,5+2(6'))] 

= 0 (n'"+^/^), 


and so 

M 5+2 = Op(n"''/^log(n)^/^). 

We conclude that the random variable 

M := maxfc,0e0^ Mk{0) < Yll=o = Op(n“^/^ log(n)^/^). 


Part (i) then follows trivially. 

To show part (ii), for s,t E [0,1], define the translated processes 
5 ^( 0 ) ■.= Ss{e)-St{e). 


We then have 

RjStiO) = RjSf\e), 
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since wavelets are orthogonal to constant functions, 


= -Pjsf\e\ 


since sf\6) = 0, 


= - Efc fo <fJ,k(s)(Ss(d) - St{0)) ds 
= 0(M), 

using the compact support of if. The desired result follows. 

To show part (hi), we first note that the processes St{0) are constant on 
the intervals [rj,rj+i) and [rAr_i,rAr]. Setting r* = 1 for i > N, we then have 

P[3 i < : Ti < 1 - Tj+i < r* + 6n] 

^ Ei=0^ ^ IPh < 1 - <Ti + Sn\Ti<l-6n] 

= o{n-^/^)EtT^nn<l-Sn], 

using Assumption 2, and that r* is a stopping time, 

= 0 ( 1 ). 

Similarly, we have 

P[3i : r, € (1 - <5„, 1)], P[iV > = o(l). 

The desired result follows. □ 
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